1 50 
- EG327 MNKIYRIIWN SALNAWVAVS ELTRNHTKRA SATVATAVLA TLLFATVQAS. 
B219S MNKIYRIIWN SALNAWWVS ELTRNHTKRA SATVATAVLA TLLFATVQAN 
BZ10 MNKISRIIWN SALNAWVWS ELTRNHTKRA SATVATAVLA TLLFATVQAN 
HIS MNKIYRIIWN SALNAWVWS ELTRNHTKRA SATVATAVLA TLLFATVQAN 
MNEILRIIWN SALNAWVWS ELTRNHTKRA SATVKTAVLA TLLFATVQAS 
MNKIYRIIWN SALNAWVWS ELTRNHTKRA SATVKTAVLA TLLFATVQAS 
H38 MNKIYRIIWN SALNAWVAVS ELTRNHTKRA SATVKTAVLA TLLFATVQAN 
P20 MNKIYRIIWN SALNAWVWS ELTRNHTKRA SATVATAVLA TLLSATVQAN 
Z2491 MNKIYRIIWN SALNAWVAVS ELTRNHTKRA SATVKTAVLA TLLFATVQAN 
H41 MNKIYRIIWN SALNAWVAVS ELTRNHTKRA SATVKTAVLA TLLFATVQAN 
Consensus MN-I-RIXWN SALNAWV-V5 ELTRNHTKRA | $ATV-TA VLA | T LL- ATVOA- 
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EG327 TTDDD ♦ * .DL YLEFVQRTAV VLSFRSDKBG TGEKE.VTED 
B2X98 ATDDD ♦ . .t>L YLEPVQRTAV VLSFRSDKEG TGEKS.GTED 
ATDDD. ♦ * DL YLEPVQRTAV VLSFRSDKEG TGEKE-GTSD 

HIS ATDDD. . *DL YLEPVQRTAV VLSFRSDKEG TGEKE.GTED 
EG329 ANNEEQEEDL YLDPVLRTVA VLIVNSDKEG TGEKEKVEEN 
PMC21 ANNEEQEEDL YLDPVQRTVA VLIVNSDKEG TGEKEKVEEN 

H3S ATDED . - BEE ELEPWRSAL VLQFKIDKEG NGENE.STGN 

P20 ATDTD..EDE ELESVARSAL VLQFMIDKEG NGEIESTGDI 
Z2491 ATDED.. EEE ELESVQR.SV VGSIQASMEG SGELET . ♦ . I 

H41 ATDED. . EEE ELESVQR. SV VGS IQASMEG SVELET. . -I 

Consensus — < L — v-ft- V EG — e-e 
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.... FTYSLK 
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H38 
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Z2491 
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151 - 200 

EG327 TEKLSFSANS NKVNXTSDTK GLNFAKKTAE TNGDTTVHLN GIGSTLTDTL 

BZ198 TEKLSFGANG NKVNXTSDTK GLNFAKETAG TNGDPTVHLN GIGSTLTDTL 

B21G TEKLSFGANG NKVNXTSDTK GLNFAKETAG TNGDPTVHLN GIGSTLTDTL 

HIS TEKLS FGANG NKVNXTSDTK GLNFAKETAG TNGDPTVHLN GIGSTLTDTL 

EG329 TEKLSFSANG NKVNXTSDTK GLNFAKETAG TNGDTTVHLN GIGSTLTDTL 

PMC21 TEKLSFSANG NKVNXTSDTK GLNFAKETAG TNGDTTVHLN GIGSTLTDTL 

H38 TEKLS FGANG NKVNXTSDTK GLNFAKETAG TNGDTTVHLN GIGSTLTDTL 

P20 TEKLS FGANG NKVNITSDTK GLNFAKETAG TNGDPTVHLN GIGSTLTDTL 

22491 TEKLS FGANG KKVNXISDTK GLNFAKETAG TNGDTTVHLN GIGSTLTDTL 

H41 TEKLSFGANG KKVNXISDTK GLNFAKETAG TNGDTTVHLN GIGSTLTDML 

Consensus *r^KT,s^-AN- - KVNI-SDTK GL NFAK -jTA- yNGD-TVHLN GIGSTLTQ— L 
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LNTGATTNVT 
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RAASIKDVLN AGWNIKGVKT 
RAASVKDVLN AGWNIKGVKP 
RAAS-K DVLN AGWNIKGVK- 
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GTTAS . . DNV 
GTTAS . - DNV 
GTTAS . . DNV 
GTTAS . . DNV 
GTTAS . . DNV 
GTTAS . . DNV 
GTTAS . . DNV 
GSTTGQSENV 
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GTTAS ♦ . DNV 
G-T -NV 
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EG327 DFVRTYDTVE FLSADTKTTT VNVESKDNGK RTEVKIGAKT 

BZ198 DFVRTYDTVE FLSADTKTTT VNVESKDNGK KTEVKIGAKT 

BZ10 DFVRTYDTVE FLSADTKTTT VNVESKDNGK RTEVKIGAKT 

HIS DFVRTYDTVE FLSADTKTTT VNVESKDNGK KTEVKIGAKT 

EG329 DFVRTYDTVE FLSADTKTTT VNVESKDNGK KTEVKIGAKT 

PMC21 DFVRTYDTVE FLSADTKTTT VNVESKDNGK KTEVKIGAKT 

H3B DFVHTYDTVE FLSADTKTTT VNVESKDNGK RTEVKIGAKT 

P20 DFVRTYDTVE FLSADTKTTT VNVESKDNGK RTEVKIGAKT 

22491 DFVRTYDTVE FLSADTKTTT VNVESKDNGK RTEVKIGAKT 

H41 DFVRTYDTVE FLSADTKTTT VNVESKDNGK KTEVKIGAKT 

Consensus DFV-TYP yVE FLSADTKTTT VNVESKDNGK -TEVKIGAKT 
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SVIKEKDGKL 
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EG327 VTGKDKGEND SSTDKGEGLV TAKEVIDAVN KAGWRMKTTT 

RZ19Q VTGKGKDENG SSTDEGSGLV TAKEVIDAVN KAGWRMKTTT 

BZ10 VTGKGKGENG 3 STDEGEGLV TAKEVIDAVN KAGWRMKTTT 

HIS VTGKGKDENG S STDEGEGLV TAKEVIDAVN KAGWRMKTTT 

EG329 VTGKDKGENG S STDEGEGLV TAKEVIDAVN KAGWRMKTTT 

PMC21 VTGKDKGENG S STDEGEGLV TAKEVIDAVN KAGWRMKTTT 

H38 VTGKGKGENG S STDEGEGLV TAKEVIDAVN KAGWRMKTTT 

B20 VTGKGKGENG S STDEGEGLV TAKEVIDAVN KAGWRMKTTT 

Z2491 VTGKGKGENG S STDEGEGLV TAKEVIDAVN KAGWRMKTTT 

H41 VTGKGKGENG S STDEGEGLV TAKEVIDAVN KAGWRMKTTT 

Consensus VTGK-K-EN- SSTD-GEGLV TAKEVTDAVN KAGWRMKTTT 
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EG327 IATSMTPQFS SVSLGAGADA PTLSVDDEGA LNVGSKDANK 

BZ198 IATSMAPQFS SVSLGAGADA PTLSVDDEGA LNVGSKDTNK 

B2110 IATSMTPQFS SVSLGAGADA PTLSVDDEGA LNVGSKDANK 

HIS IATSMTPQFS SVSLGAGADA PTLSVDDEGA LNVGSKDANK 

EG329 IATSMTPQFS SVSLGAGADA PTLS VDG « DA LNVGSKKDNK 

PMC21 IATSMTPQFS SVSLGAGADA PTLSVDG. DA LNVGSKKDNK 

H3 8 IATSMTPQFS S VS LGAGADA PTLSVDDKGA LNVGSKDANK 

P20 IATSMTPQFS SVSLGAGADA PTLSVDDEGA LNVGSKDANK 

Z2491 IATSMAPQFS SVSLGAGADA PTLSVDDEGA LNVGSKDANK 

H41 IATSMTPQFS SVSLGAGADA PTLSVDDEGA LNVGSKDANK 

Consensus IATSM-POFS SVSLGAGADA PTLSVD A LNVG5K — NK 
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CAGCCAGAGA 
CAGCCAGAGA 
CAfiCCGGAAC 



AATCACCCTC 
AATCACCCTC 
AATCACCCTC 
CGTTACCCTC 
CGTTACCCTC 
,GTTACCCTC 
AGTTACCCTC 
AATCACCCTC 
AATCACCCTC 
AATCACCCTC 

— t-accctc 



AAAGCCGGCG 
AAAGCCGGCG 
AAAGCCGGCG 
AAAGCCGGCG 
AAAGCCGGCG 
AAAGCCGGCG 
AAAGCCGGCG 
AAAGCCGGCG 
AAAGCCGGCG 
AAAGCCGGCG 
AAAGCCGGCG 



C2 



350 

ACAACCTGAA 
ACAACCTGAA 
ACAACCTGAA 
ACAACCTGAA 
ACAACCTGAA 
ACAACCTGAA 
ACAACCTGAA 
ACAACCTGAA 
ACAACCTGAA 
ACAACCTGAA 

&CArc9£5ftft- 



351 420 
Hl5 AATCAAACAA AACACCAATG AAAACACCAA TGAAAACACC AATGACAGTA GCTTCACCTA CTCCCTGAAA 
AATCAAACAA AACACCAATG AAAACACCAA TGAAAACACC AATGACAGTA GCTTCACCTA CTCCCTGAAA 

BZ198 AATCAAACAA AACACCAATG AAAACACC AATCACAGTA GCTTCACCTA CTCCCTGAAA 

P20 AATCAAACAA AGCGGCAAAG A -CTTCACCTA CTCGCTGAAA 

H38 AATCAAACAA AACACCAATA AAAACACCAA TGAAAACACC AATGACAGTA GCTTCACCTA CTCGCTGAAA 

22491 AATCAAACAA AACACCAATG AAAACACC. . , , AATGCCAGTA GCTTCACCTA CTCGCTGAAA 

H41 AATCAAACAA AACACCAATG AAAACACC * AATGCCAGTA GCTTCACCTA CTCGCTGAAA 

EG329 AATCAAACAA AAC. G. . GCACAA ACTTCACCTA CTCGCTGAAA 

PMC21 AATCAAACAA AAC . . . ~ % ,G. . . * GCACAA ACTTCACCTA CTCGCTGAAA 

EG327 AATCAAACAA AACACCAATG AAAACACC . AATGCCAGTA GCTTCACCTA CTCGCTGAAA 

Consensus AftT<frA ACAA - A-C r CTTCACCTA ^C,-CTGfflA 

C2 V2 C3 



HIS 
BZ10 
BS198 

220 

H38 
22491 

H4I 
EG329 
PMC21 
BG327 
consensu* 



421 

AAAGACCTCA 
AAAGACCTCA 
AAAGACCTCA 
AAAGAGCTGA 
AAAGACCTCA 
AAAGACCTCA 
AAAGACCTCA 
AAAGACCTCA 
AAAGACCTCA 
AAAGACCTCA 
AAAGA-CT-A 



CAGATCTGAC 
CAGATCTGAC 
CAGATCTGAC 
AAGACCTGAC 
CAGATCTGAC 
CAGGCCTGAT 
CAGGCCTGAT 
CAGATCTGAC 
CAGATCTGAC 
CAGATCTGAC 
"AS- *-CTGA~ 



CAGTGTTGAA 
CAGTGTTGAA 
CAGTGTTGAA 
CAGTGTTGAA 
CAGTGTTGAA 
CAATGTTGAA 
CAATGTTGAA 
CAGTGTTGGA 
CAGTGTTGGA 
CAGTGTTGGA 
CA-TGTTG-A 



ACTGAAAAAT 
ACTGAAAAAT 
ACTGAAAAAX 
ACTGAAAAAT 
ACTGAAAAAT 
ACTGAAAAAT 
ACTGAAAAAT 
ACTGAAAAAT 
ACTGAAAAAT 
ACTGAAAAAT 
ACTGAA AAAT, 



TATCGTTTGG 
TATCGTTTGG 
TATCGTTTGG 
TATCGTTTGG 
TATCGTTTGG 
TATCGTTTGG 
TATCGTTTGG 
TATCGTTTAG 
TATCGTTTAG 
TATCGTTTAG 
TftTCGTTT- G 



C3 



CGCAAACGGT 
CGCAAACGGT 
CGCAAACGGT 
CGCAAACGGT 
CGCAAACGGC 
CGCAAACGGC 
CGCAAACGGC 
CGCAAACGGC 
"CGCAAACGGC 
CGCAAACAGC 
CG C AfiA C -g- 



490 

aataaagtca 
aataaastca 

AATAAAGTCA 
AATAAAGTCA 
AATAAAGTCA 
AAGAAAGTCA 
AAGAAAGTCA 
AATAAAGTCA 
AATAAAGTCA 
AATAAAGTCA 
AATAAAGTCA 



HIS 
BZ10 
BZ19S 

P20 

H38 
K2491 

H41 
EG329 
PMC 21 
BG327 
Consensus 



491 

ACATCACAAG 
ACATCACAAG 
ACATCACAAG 
ACATCACAAG 
ACATCACAAG 
ACATCATAAG 
ACATCATAAG 
ACATCACAAG 
ACATCACAAG 
ACATCACAAG 
ACATCA-AAG 



CGACACCAAA 
CGACACCAAA 
CGACACCAAA 
CGACACCAAA 
CGACACCAAA 
CGACACCAAA 
CGACACCAAA 
CGACACCAAA 
CGACACCAAA 
CGACACCAAA 
CGACACCAAA 



GGCTTGAATT 
GGCTTGAATT 
GGCTTGAATT 
GGCTTGAATT 
GGCTTGAATT 
GGCTTGAATT 
GGCTTGAATT 
GGCTTGAATT 
GGCTTGAATT 
GGCTTGAATT 
ffiCTTG AATT 



TTGCGAAAGA 
TTGCGAAAGA. 
TTGCGAAAGA 
TTGCGAAAGA 
TCGCGAAAGA 
TCGCGAAAGA 
TCGCGAAAGA 
TTGCGAAAGA 
TTGCGAAAGA 
TCGCGAAAAA 
T; -G fGAAA-A 



C3 



AACGGCTGGG 
AACGGCTGGG 
AACGGCTGGG 
AACGGCTGGG 
AACGGCTGGG 
AACGGCTGGG 
AACGGCTGGG 
AACGGCTGGG 
AACGGCTGGG 
AACGGCTGAG 
AACGGCTG— G 



ACGAACGGCG 
ACGAACGGCG 
ACGAACGGCG 
ACGAACGGCG 
ACGAACGGCG 
ACGAACGGCG 
ACGAACGGCG 
ACGAACGGCG 
ACGAACGGCG 
ACCAACGGCG 
i t AC-AAC GGCG 



560 

accccacggt 
accccacggt 
accccacggt 
accccacggt 
acaccacggt 
acaccacggt 
acaccacggt 

ACACCACGGT 
ACACCACGGT 
ACACCACGGT 
AC-CCftCG GT 



HIS 
BZlO 
BZ196 

220 

H38 
Z2491 

K41 
BG329 

EG327 
Consensus 



TCATCTGAAC 
TCATCTGAAC 
TCATCTGAAC 
TCATCTGAAC 
TCATCTGAAC 
TCATCTGAAC 
TCATCTGAAC 
TCATCTGAAC 
TCATCTGAAC 
TCATCTGAAC 
TCATCTGAAC 



GGTATCGGTT 
GGTATCGGTT 
GGTATCGGTT 
GGTATCGGTT 
GGTATTCGTT 
GGTATCGGTT 
GGTATCGGTT 
GGTATTGGTT 
GGTATTGGTT 
GGTATCGGTT 
GCTAT^gGTT 

C3 



CGACTTTGAC 
CGACTTTGAC 
CGACTTTGAC 
CGACTTTGAC 
CGACTTTGAC 
CGACTTTGAC 
CGACTTTGAC 
CGACTTTGAC 
CGACTTTGAC 
CGACTTTGAC 
CGACTTTGAC 



CGATACGCTG 
CGATACGCTG 
CGATACGCTG 
CGATACGCTT 
CGATACGCTG 
CGATACGCTT 
CGATATGCTG 
CGATACGCTG 
CGATACGCTG 
CGATACGCTG 
CffftT A-GCT~ 



CTGAATACCG 
CTGAATACCG 
CTGAATACCG 
GCGGGTTCTT 
CTGAATACCG 
GCGGGTTCTT 
CTGAATACCG 
CTGAATACCG 
CTGAATACCG 
CTGAATACCG 



GAGCGACCAC 
GAGCGACCAC 
GAGCGACCAC 
CTGCTTCTCA 
GAGCGACCAC 
CTGCTTCTCA 
GAGCGACCAC 
GAGCGACCAC 
GAGCGACCAC 
GAGCGACCAC 



€30 

AAACGTAACC 
AAACGTAACC 
AAACGTAACC 
CGTTGATGCG 
AAACGTAACC 
CGTTGATGCG 
AAACGTAACC 
AAACGTAACC 
AAACGTAACC 
AAACGTAACC 
— G C- 



V3 



FIG. 2cont 



HIS 
BZ10 
B2198 

P20 

H38 
Z2491 

H41 
EG329 
PMC21 
SG327 
consensus 



631 

AACGACAACG 
AACGACAACG 
AACGACAACG 
GGTAACCAAA 
AACGACAACG 
GGTAACCAAA 
AACGACAACG 
AACGACAACG 
AACGACAACG 
AACGACAACG 
— "AC-A." 



TTACCGATGA 
TTACCGATGA 
TTACCGATGA 
GTACACATTA 
TTACCGATGA 

TTACCGATGA 
TTACCGATGA 
TTACCGATGA 
TTACCGATGA 
-TAC-— AT-A 



CGAGAAAAAA 
CGAGAAAAAA 
CGAGAAAAAA 
C, . , . - ,ACT 
CAAGAAAAAA 

C ACT 

CGAGAAAAAA 
CGAGAAAAAA 
CGAGAAAAAA 
CGAGAAAAAA 
C A— 



CGTGCGGCAA 
CGTGCGGCAA 
CGTGCGGCAA 
CGTGCAGCAA 
CGTGCGGCAA 
CGTGCAGCAA 
CGTGCGGCAA 
CGTGCGGCAA 
CGTGCGGCAA 
CGTGCGGCAA 
CGTGC-GCAA 



V3 



GCGTTAAAGA 
GCGTTAAAGA 
GCGTTAAAGA 
GTATTAAGGA 
GCGTTAAAGA 
GTATTAAGGA 
GCGTTAAAGA 
GCGTTAAAGA 
GCGTTAAAGA 
GCGTTAAAGA 
G — TtAA-GA 



CGTATTAAAC 
CGTATTAAAC 
CGTATTAAAC 
TGTGTTGAAT 
CGTATTAAAC 
TGTGTTGAAT 
CGTATTAAAC 
CGTATTAAAC 
CGTATTAAAC 
CGTATTAAAC 
-GT-TT-AA- 



C4 



700 

GCAGGCTGGA 
GCAGGCTGGA 
GCAGGCTGGA 
GCGGGTTGGA 
GCAGGCTGGA 
GCGGGTTGGA 
GCAGGCTGGA 
GCAGGCTGGA 
GCTGGCTGGA 
GCAGGCTGGA 
GC— GG- TGGA 



701 

HIS ACATTAAAGG CGTTAAACCC 

BZ10 ACATTAAAGG CGTTAAACCC 

B219£ ACATTAAAGG CGTTAAACCC 

P20 ATATTAAGGG TGTTAAAACT 

K38 ACATTAAAGG CGTTAAACCC 

Z2491 ATATTAAGGG TGTTAAAACT 

H41 ACATTAAAGG CGTTAAACCC 

EG329 ACATTAAAGG CGTTAAACCC 

&MC21 ACATTAAAGG CGTTAAACCC 

EG327 ACATTAAAGG CGTTAAACCC 

Consensus A^ ATTAA-C^ -&TTAAA-C- 



C4 



770 

GGTACAACAG CT TC CGATAACGTT GATTTCGTCC GCACTTACGA 

GGTACAACAG CT , .TC CGATAACGTC GATTTCGTCC GCACTTACGA 

GGTACAACAG CT TC CGATAACGTT GATTTCGTCC GCACTTACGA 

GGCTCAACAA CTGGTCAATC AGAAAATGTC GATTTCGTCC GCACTTACGA 

GGTACAACAG CT TC CGATAACGTT GATTTCGTCC ACACTTACGA 

GGCTCAACAA CTGGTCAATC AGAAAATGTC GATTTCGTCC GCACTTACGA 

GGTACAACAG CT TC CGATAACGTT GATTTCGTCC GCACTTACGA 

GGTACAACAG CT TC CGATAACGTT GATTTCGTCC GCACTTACGA 

GGTACAACAG CT TC CGATAACGTT GATTTCGTCC GCACTTACGA 

GGTACAACAG CT TC CGATAACGTT GATTTCGTCC GCACTTACGA 

GG — CAACA- CT — TC -GA-AA-GT- GATTTCGTCC -G&CTTACGA 



V4 



C5 



771 840 

HIS CACAGTCGAG TTCTTGAGCG CAGATACGAA AACAACGACT GTTAATGTGG AAAGCAAAGA CAACGGCAAG 

BZ10 CACAGTCGAG TTCTTGAGCG CAGATACGAA AACAACGACT GTTAATGTGG AAAGCAAAGA CAACGGCAAG 

BS198 CACAGTCGAG TTCTTGAGCG CAGATACGAA AACAACGACT GTTAATGTGG AAAGCAAAGA CAACGGCAAG 

P20 CACAGTCGAG TTCTTGAGCG CAGATACGAA AACAACGACT GTTAATGTGG AAAGCAAAGA CAACGGCAAG 

H30 CACAGTCGAG TTCTTGAGCG CAGATACGAA AACAACGACT GTTAATGTGG AAAGCAAAGA CAACGGCAAG 

Z2491 CACAGTCGAG TTCTTGAGCG CAGATACGAA AACAACGACT GTTAATGTGG AAAGCAAAGA CAACGGCAAG 

H41 CACAGTCGAG TTCTTGAGCG CAGATACGAA AACAACGACT GTTAATGTGG AAAGCAAAGA CAACGGCAAG 

EG329 CACAGTCGAG TTCTTGAGCG CAGATACGAA AACAACGACT GTTAATGTGG AAAGCAAAGA. CAACGGCAAG 

PMC21 CACAGTCGAG TTCTTGAGCG CAGATACGAA AACAACGACT GTTAATGTGG AAAGCAAAGA CAACGGCAAG 

EG327 CACAGTCGAG TTCTTGAGCG CAGATACGAA AACAACGACT GTTAATGTGG AAAGCAAAGA CAACGGCAAG 

Consensus CATJMGTCGAC TTCTTGAGCG CAGATACGAA AACA ACGACT GTTAATGTGG AAAGCAAAGA CAACGGCAAG 

cs - 



HIS 
BZ10 
8Z198 

P20 

H3S 
Z2491 

H41 
EG329 
PMC21 
EG327 
Consensus 



841 

AAAACCGAAG 
AGAACOSAAG 
AAAACCGAAG 

agaaccgaag 

AGAACCGAAG 
AGAACCGAAG 
AAAACCGAAG 
AAAACCGAAG 
AAAACCGAAG 
AGAACCGAAG 
A-AACCGAAg 



TTAAAATCGG 
TTAAAATCGG 
TTAAAATCGG 
TTAAAATCGG 
TTAAAATCGG 
TTAAAATCGG 
TTAAAATCGG 
TTAAAATCGG 
TTAAAATCGG 
TTAAAATCGG 
TTAAA ATCGG 



TGCGAAGACT 
TGCGAAGACT 
TGCGAAGACT 
TGCGAAGACT 
TGCGAAGACT 
TGCGAAGACT 
TGCGAAGACT 
TGCGAAGACT 
TGCGAAGACT 
TGCGAAGACT 
TGCGAAGACT 



TCTGTTATTA 
TCTGTTATTA 
TCTGTTATTA 
TCTGTTATTA 
TCTGTTATTA 
TCTGTTATTA 
TCTGTTATTA 
TCTGTTATTA 
TCTGTTATTA 
TCTGTTATCA 

TCTGTTAT-A 



C5 



AAGAAAAAGA 
AAGAAAAAGA 
AAGAAAAAGA 
AAGAAAAAGA 
AAGAAAAAGA 
AAGAAAAAGA 
AAGAAAAAGA 
AAGAAAAAGA 
AAGAAAAAGA 
AAGAAAAAGA 
AAGAAAAAGA 



CGGTAAGTTG 
CGGTAAGTTG 
CGGTAAGTTG 
CGGTAAGTTG 
CGGTAAGTTG 
CGGTAAGTTG 
CGGTAAGTTG 
CGGTAAGTTG 
CGGTAAGTTG 
CGGTAAGTTG 
CGGTAAGTTG 



910 

GTTACTGGTA 
GTTACTGGTA 
GTTACTGGTA 
GTTACTGGTA 
GTTACTGGTA 
GTTACTGGTA 
GTTACTGGTA 
GTTACTGGTA 
GTTACTGGTA 
GTTACTGGTA 
GTTACTGGTA 



HIS 
BZ10 
BZ198 

vzo 

K38 
32491 

H41 
EG329 
PMC21 
BG321 
Conaensus 



911 

AAGGCAAAGA 
AAGGCAAAGG 
AAGGCAAAGA 
AAGGCAAAGG 
AAGGCAAAGG 
AAGGCAAAGG 
AAGGCAAAGG 
AAGACAAAGG 
AAGACAAAGG 
AAGACAAAGG 
AAG-CAAAG- 



CGAGAATGGT 
CGAGAATGGT 
CGAGAATGGT 
CGAGAATGGT 
CGAGAATGGT 
CGAGAATGGT 
CGAGAATGGT 
CGAGAATGGT 
CGAGAATGGT 
CGAGAATGAT 
CGAffAAftG-T 



TCTTCTACAG 
TCTTCTACAG 
TCTTCTACAG 
TCTTCTACAG 
TCTTCTACAG 
TCTTCTACAG 
TCTTCTACAG 
TCTTCTACAG 
TCTTCTACAG 
TCTTCTACAG 
TCTTCTACAG 



ACGAAGGCGA 
ACGAAGGCGA 
ACGAAGGCGA 
ACGAAGGCGA 
ACGAAGGCGA 
ACGAAGGCGA 
ACGAAGGCGA 
ACGAAGGCGA 
ACGAAGGCGA 
ACAAAGGCGA 
AC-AA^GA 



AGGCTTAGTG 
AGGCTTAGTG 
AGGCTTAGTG 
AGGCTTAGTG 
AGGCTTAGTG 
AGGCTTAGTG 
AGGCTTAGTG 
AGGCTTAGTG 
AGGCTTAGTG 
AGGCTTAGTG 
AGGCTTAGT G 



ACTGCAAAAG 
ACTGCAAAAG 
ACTGCAAAAG 
ACTGCAAAAG 
ACTGCAAAAG 
ACTGCAAAAG 
ACTGCAAAAG 
ACTGCAAAAG 
ACTGCAAAAG 
ACTGCAAAAG 
ACTjGCAAAAG 



960 

AAGTGATTGA 
AAGTGATTGA 
AAGTGATTGA 
AAGTGATTGA 
AAGTGATTGA 
AAGTGATTGA 
AAGTGATTGA 
AAGTGATTGA 
AAGTGATTGA 
AAGTGATTGA 
AAGTGATTGA 



FIG. 2cont 



his 

P20 
H38 
Z2491 
H41 
EG329 
2MC21 
EG327 
Consensus 



TGCAGTAAAC 
TGCAGTAAAC 
TGCAGTAAAC 
TGCAGTAAAC 
TGCAGTAAAC 
TGCAGTAAAC 
TGCAGTAAAC 
TGCAGTAAAC 
TGCAGTAAAC 
TGCAGTAAAC 
TGCAGTAAAC 



AAGGCTGGTT 
AAGGCTGGTT 
AAGGCTGGTT 
AAGGCTGGTT 
AAGGCTGGTT 
AAGGCTGGTT 
AAGGCTGGTT 
AAGGCTGGTT 
AAGGCTGGTT 
AAGGCTGGTT 
AAGGC T GffrT 



GGAGAATGAA AACAACAACC 
GGAGAATGAA AACAACAACC 
GGAGAATGAA AACAACAACC 
GGAGAATGAA AACAACAACC 
GGAGAATGAA AACAACAACC 
GGAGAATGAA AACAACAACC 
GGAGAATGAA AACAACAACC 
GGAGAATGAA AAOlACAACC 
GGAGAATGAA AACAACAACC 
GGAGAATGAA AACAACAACC 
GGAGAATGAA AACAACAACC 



C5 



GCTAATGGTC AAACAGGTCA 
GCTAATGGTC AAACAGGTCA 
GCTAATGGTC AAACAGGTCA 
GCTAATGGTC AAACAGGTCA 
GCTAATGGTC AAACAGGTCA. 
GCTAATGGTC AAACAGGTCA 
GCTAATGGTC AAACAGGTCA 
GCTAATGGTC AAACAGGTCA 
GCTAATGGTC aaacaggtca 
GCTAATGGTC AAACAGGTCA 
GCTAATGgTC AAACAGGTCA 



1050 
AGCTGACAAG 
AGCTGACAAG 
AGCTGACAAG 
AGCTGACAAG 
AGCTGACAAG 
AGCTGACAAG 
AGCTGACAAG 
AGCTGACAAG 
AGCTGACAAG 
AGCTGACAAG 
AGCTGACAAG 



1051 

HIS TTTGAAACCG TTAC AT CAGG CACAAAAGTA 

B210 TTTGAAACCG TTACATCAGG CACAAAAGTA 

BSldS TTTGAAACCG TTACATCAGG CACAAATGTA 

P20 TTTGAAACCG TTACATCAGG CACAAAAGTA 

H38 TTTGAAACCG TTACATCAGG CACAAATGTA 

Z2491 TTTGAAACCG TTACATCAGG. CACAAATGTA 

H41 TTTGAAACCG TTACATCAGG CACAAAAGTA 

EG329 TTTGAAACCG TTACATCAGG CACAAATGTA 

PfrtCZl TTTGAAACCG TTACATCAGG CACAAATGTA 

EG327 TTTGAAACCG TTACATCAGG CACAAATGTA 

Consensus TTTGAAflCC^ ttacatcagc cAcaaa-gta 



1120 

acctttgcta gtggtaatgg tacaactgcg actgtaagta 
acctttgcta gtggtaatgg tacaactgcg actgtaagta 
acctttgcta gtggtaaagg TACAACTGCG ACTGTAAGTA 
ACCTTTGCTA GTGGTAATGG TACAACTGCG ACTGTAAGTA 
ACCTTTGCTA GTGGTAAAGG TACAACTGCG ACTGTAAGTA 
ACCTTTGCTA GTGGTAAAGG TACAACTGCG ACTGTAAGTA 
ACCTTTGCTA GTGGTAATGG TACAACTGCG ACTGTAAGTA 
ACCTTTGCTA GTGGTAAAGG TACAACTGCG ACTGTAAGTA 
ACCTTTGCTA GTGGTAAAGG TACAACTGCG ACTGTAAGTA 
ACCTTTGCTA GTGGTAAAGG TACAACTGCG ACTGTAAGTA 
ACCTTTGCTA, GTGGTAA-GG TACAACTGCG ACTGTAAG TA 



C5 



HIS 
BSlO 
BZX9S 

P20 

H38 
Z2491 

H41 
EG329 
PHC21 
EG327 
Consensu? 



1121 

aagatgatca 

AAGATGATCA 
AAGATGATCA 
AAGATGATCA 
AAGATGATCA 
AAGATGATCA 
AAGATGATCA 
AAGATGATCA 
AAGATGATCA 
AAGATGATCA 
AAGATGATCA 



AGGCAACATC actgttaagt 

AGGCAACATC ACTGTTAAGT 
AGGCAACATC ACTGTTAAGT 
AGGCAACATC ACTGTTAAGT 
AGGCAACATC ACTGTTAAGT 
AGGCAACATC ACTGTTATGT 
AGGCAACATC ACTGTTAAGT 
AGGCAACATC ACTGTTATGT 
AGGCAACATC ACTGTTATGT 
AGGCAACATC ACTGTTATGT 
AGGCAACATC ACTGTTA-ST 



1190 

ATGATGTAAA TGTCGGCGAT GCCCTAAACG TCAATCAGCT 
ATGATGTAAA TGTCGGCGAT GCCCTAAACG TCAATCAGCT 
ATGATGTAAA TGTCGGCGAT GCCCTAAACG TCAATCAGCT 
ATGATGTAAA TGTCGGCGAT GCCCTAAACG TCAATCAGCT 
ATGATGTAAA TGTCGGCGAT GCCCTAAACG TCAATCAGCT 
ATGATGTAAA TGTCGGCGAT GCCCTAAACG TCAATCAGCT 
ATGATGTAAA TGTCGGCGAT GCCCTAAACG TCAATCAGCT 
ATGATGTAAA TGTCGGCGAT GCCCTAAACG TCAATCAGCT 
ATGATGTAAA TGTCGGCGAT GCCCTAAACG TCAATCAGCT 
ATGATGTAAA TGTCGGCGAT GCCCTAAACG TCAATCAGCT 
ATGATGT AAA ^ TGTCG GCG AT GC Cftff frAACG JC A ftTCA GCT 



C5 



BZ10 
BZ198 
$20 
H38 
22491 
H41 
EG329 
PMC21 
EG327 
Consensus 



1191 

GCAAAACAGC 
GCAAAACAGC 
GCAAAACAGC 
GCAAAACAGC 
GCAAAACAGC 
GCAAAACAGC 
GCAAAACAGC 
GCAAAACAGC 
GCAAAACAGC 
GCAAAACAGC 
GCAAAACAGC 



GGTTGGAATT 
GGTTGGAATT 
GGTTGGAATT 
GGTTGGAATT 
GGTTGGAATT 
GGTTGGAATT 
GGTTGGAATT 
GGTTGGAATT 
GGTTGGAATT 
GGTTGGAATT 
GGTTGGAATT 



TGGATTCCAA 
TGGATTCCAA 
TGGATTCCAA 
TGGATTCCAA 
TGGATTCCAA 
TGGATTCCAA 
TGGATTCCAA 
TGGATTCCAA 
TGGATTCCAA 
TGGATTCCAA 
^ GGATTCCAA , 



AGCGGTTGCA 
AGCGGTTGCA 
AGCGGTTGCA 
AGCGGTTGCA 
AGCGGTTGCA 
AGCGGTTGCA 
AGCGGTTGCA 
AGCGGTTGCA 
AGCGGTTGCA 
AGCGGTTGCA 
AGCGGTTGCA, 



GGTTCTTCGG 
GGTTCTTCGG 
GGTTCTTCGG 
GGTTCTTCGG 
GGTTCTTCGG 
GGTTCTTCGG 
GGTTCTTCGG 
GGTTCTTCGG 
GGTTCTTCGG 
GGTTCTTCGG 
GGTTCTTCGG 



C5 



GCAAAGTCAT 
GCAAAGTCAT 
GCAAAGTCAT 
GCAAAGTCAT 
GCAAAGTCAT 
GCAAAGTCAT 
GCAAAGTCAT 
GCAAAGTCAT 
GCAAAGTCAT 
GCAAAGTCAT 
GCAAAGTCAT 



1260 
CAGCGGCAAT 
CAGCGGCAAT 
CAGCGGCAAT 
CAGCGGCAAT 
CAGCGGCAAT 
CAGCGGCAAT 
CAGCGGCAAT 
CAGCGGCAAT 
CAGCGGCAAT 
CAGCGGCAAT 
CAGCGGCAAT 



1261 . 1330 

HI 5 GTTTCGCCGA GCAAGGGAAA. GATGGATGAA ACCGTCAACA TTAATGCCGG CAACAACATC GAGATTACCC 
BZ1Q GTTTCGCCGA GCAAGGGAAA GATGGATGAA ACCGTCAACA TTAATGCCGG CAACAACATC GAGATTACCC 
GTTTCGCCGA GCAAGGGAAA GATGGATGAA ACCGTCAACA TTAATGCCGG CAACAACATC GAGATTACCC 
P20 GTTTCGCCGA GCAAGGGAAA GATGGATGAA ACCGTCAACA TTAATGCCGG CAACAACATC GAGATTACCC 
H38 GTTTCGCCGA GCAAGGGAAA GATGGATGAA ACCGTCAACA TTAATGCCGG CAACAACATC GAGATTACCC 
22491 GTTTCGCCGA GCAAGGGAAA GATGGATGAA ACCGTCAACA TTAATGCCGG CAACAACATC GAGATTAGCC 
RAl GTTTCGCCGA GCAAGGGAAA GATGGATGAA ACCGTCAACA TTAATGCCGG CAACAACATC GAGATTACCC 
EG329 GTTTCGCCGA GCAAGGGAAA GATGGATGAA ACCGTCAACA TTAATGCCGG CAACAACATC GAGATTACCC 
PMC21 GTTTCGCCGA GCAAGGGAAA GATGGATGAA ACCGTCAACA TTAATGCCGG CAACAACATC GAGATTACCC 
GTTTCGCCGA GCAAGGGAAA GATGGATGAA ACCGTCAACA TTAATGCCGG CAACAACATC GAGATTACCC 
Consensus GTTTCGCCCA | GCAAGGGaAA GATGGATGAA ACCGTCAACA | TTAATG CC GG CAACAACATC | G A | GATTA-CC 



C5 



FIG. 2cont 



HIS 
5220 
B3198 

P20 

H38 
22491 

H41 
EG329 
PMC21 
EG327 
Consensus 



1331 

GCAACGGCAA 
GCAACGGCAA 
GCAACGGTAA 
GCAACGGCAA 
GCAACGGTAA 
GCAACGGTAA 
GCAACGGCAA 
GCAACGGTAA 
GCAACGGTAA 
GCAACGGCAA 
GCAACGG-AA 



AAATATCGAC 
AAATATCGAC 
AAATATCGAC 
AAATATCGAC 
AAATATCGAC 
AAATATCGAC 
AAATATCGAC 

AAATATCGAC 
AAATATCGAC 
AAATfiTCGAC 



ATCGCCAClT 
ATCGCCACTT 
ATCGCCACTT 
ATCGCCACTT 
ATCGCCACTT 
ATCGCCACTT 
ATCGCCACTT 
ATCGCCACTT 
ATCGCCACTT 
ATCGCCACTT 
ATCGCC ACTT 



CGATGACCCC 
CGATGACCCC 
CGATGGCGCC 
CGATGACCCC 
CGATGACCCC 
CGATGGCGCC 
CGATGACCCC 
CGATGACCCC 
CGATGACCCC 
CGATGACCCC 
CGATG-C-CC 



C5 



GCAATTTTCC 
GCAATTTTCC 
GCAGTTTTCC 
GCAATTTTCC 
GCAGTTTTCC 
GCAGTTTTCC 
GCAATTTTCC 
GCAGTTTTCC 
GCAGTTTTCC 
GCAATTTTCC 



AGCGTTTCGC 
AGCGTTTCGC 
AGCGTTTCGC 
AGCGTTTCGC 
AGCGTTTCGC 
AGCGTTTCGC 
AGCGTTTCGC 
AGCGTTTCGC 
AGCGTTTCGC 
AGCGTTTCGC 
AGCGTTTC/SC 



1400 
TCGGCGCGGG 
TCGGCGCGGG 
TCGGTGCGGG 
TCGGCGCGGG 
TCGGCGCGGG 
TCGGCGCGGG 
TCGGCGCGGG 
TCGGCGCGGG 
TCGGCGCGGG 
TCGGCGCGGG 



1401 1470 

HIS GGCGGATGCG CCCACTTTAA GCGTGGATGA CGAGGGCGCG TTGAATGTCG GCAGCAAGGA TGCCAACAAA 

B210 GGCGGATGCG CCCACTTTAA GCGTGGATGA CGAGGGCGCG TTGAATGTCG GCAGCAAGGA TGCCAACAAA 

BZ198 GGCGGATGCG CCCACTTTGA GCGTGGATGA CGAGGGCGCG TTGAATGTCG GCAGCAAGGA TACCAACAAA 

P20 GGCGGATGCG CCCACTTTAA GCGTGGATGA CGAGGGCGCG TTGAATGTCG GCAGCAAGGA TGCCAACAAA 

H36 GGCGGATGCG CCCACTTTGA GCGTGGATGA CAAGGGcGCG TTGAATGTCG GCAGCAAGGA TGCCAACAAA 

22491 GGCAGATGCG CCCACTTTAA GCGTGGATGA CGAGGGCGCG TTGAATGTCG GCAGCAAGGA TGCCAACAAA 

H41 GGCGGATGCG CCCACTTTAA GCGTGGATGA CGAGGGCGCG TTGAATGTCG GCAGCAAGGA TGCCAACAAA 

EG329 GGCGGATGCG CCCACTTTGA GCGTGGAT - , .GGGGACGCA TTGAATGTCG G<SAGCAAGAA GGACAACAAA 

PMC21 GGCGGATGCG CCCACTTTGA GCGTGGAT. , .GGGGACGCA TTGAATGTCG GCAGCAAGAA^iGGACAACAAA 

EG32T GGCGGATGCG CCCACTTTAA GCGTGGATGA CGAGGGCGCG TTGAATGTCG GCAGCAAGGA TGCCAACAAA 

Consensus Ggc-gatccg cgcagttT-a gcgtg gat GG-C6C- TTGAATGTCG GCAGCAAGGA CAACAAA 



C5 



HIS 
BS10 
BZ198 

£20 

H36 
22491 

H41 
EG329 
PMC21 
SG327 
Consensus 



1471 

CCCGTCCGCA 
CCCGTCCGCA 
CCCGTCCGCA 
CCCGTCCGCA 
CCCGTCCGCA 
CCCGTCCGCA 
CCCGTCCGCA 
CCCGTCCGCA 
CCCGTCCGCA 
CCCGTCCGCA 
CCCGTCCGCA. 



TTACCAATGT 
TTACCAATGT 
TTACCAATGT 
TTACCAATGT 
TTACCAATGT 
TTACCAATGT 
TTACCAATGT 
TTACCAATGT 
TTACCAATGT 
TTACCAATGT 
FTACCA*ffl^T 



CGCCCCGGGC 
CGCCCCGGGC 
CGCCCCGGGC 
CGCCCCGGGC 
CGCCCCGGGC 
CGCCCCGGGC 
CGCCCCGGGC 
CGCCCCGGGC 
CGCCCCGGGC 
CGCCCCGGGC 
CGCCCCGGGC 



GTTAAAGAGG 

gttaaagagg 
gttaaagagg 
gttaaagagg 
gttaaagagg 
gttaaagagg 
gttaaagagg 

GTTAAAGAGG 
GTTAAAGAGG 
GTTAAAGAGG 
G T^ftfrAGAGG 



GGGATGTTAC 
GGGATGTTAC 
GGGATGTTAC 
GGGATGTTAC 
GGGATGTTAC 
GGGATGTTAC 
GGGATGTTAC 
GGGATGTTAC 
GGGATGTTAC 
GGGATGTTAC 
GGGATGTTAC 



C5 



AAACGTCGCA 
AAACGTCGCA 
AAACGTCGCA 
AAACGTCGCA 
AAACGTCGCA 
AAACGTCGCA 
AAACGTCGCG 
AAACGTCGCA 
AAACGTCGCA 
AAACGTCGCA 
AAACGTCgC-, 



1S40 
CAACTTAAAG 
CAACTTAAAG 
CAACTTAAAG 
CAACTTAAAG 
CAACTTAAAG 
CAACTTAAAG 
CAACTTAAAG 
CAACTTAAAG 
CAACTTAAAG 
CAACTTAAAG 
CAA-CTJAA^G 



H15 
BZX0 
B23.93 

P20 

H38 
22491 

H41 
EG329 
PMC 21 
EG327 
Consensus 



1541 

GTGTGGCGCA 
GTGTGGCGCA 
GCGTGGCGCA 
GTGTGGCGCA 
GCGTGGCGCA 
GCGTGGCGCA 
GTGTGGCGCA 
GCGTGGCGCA 
GCGTGGCGCA 
GCGTGGCGCA 
G-GTGGCGCA 



AAACTTGAAC 
AAACTTGAAC 
AAACTTGAAC 
AAACTTGAAC 
AAACTTGAAC 
AAACTTGAAC 
AAACTTGAAC 
AAACTTGAAC 
AAACTTGAAC 
AAACTTGAAC 
AA ftCTTGAafi 



AACCGCATCG 
AACCGCATCG 
AACCGCATCG 
AACCGCATCG 
AACCGCATCG 
AACCGCATCG 
AACCGCATCG 
AACCGCATCG 
AACCGCATCG 
AACCACATCG 



ACAATGTGGA 
ACAATGTGGA 
ACAATGTGGA 
ACAATGTGAA 
ACAATGTGGA 
ACAATGTGGA 
ACAATGTGAA 
ACAATGTGGA 
ACAATGTGGA 
ACAATGTGGA 



CGGCAACGCG 
CGGCAACGCG 
CGGCAACGCG 
CGGCAACGCG 
CGGCAACGCG 
CGGCAACGCG 
CGGCAACGCG 
CGGCAACGCG 
CGGCAACGCG 
CGGCAACGCG 
.CGGCAACGCG 



CGCGCGGGTA 
CGCGCGGGTA 
C&TGCGGGCA 
CGCGCGGGTA 
CGTGCGGGCA 
CGTGCGGGCA 
CGTGCGGGCA 
CGTGCGGGCA 
CGTGCGGGCA 
CGTGCGGGCA 
CG-GCGGG-A 



1610 
TCGCCCAAGC 
TCGCCCAAGC 
TCGCCCAAGC 
TCGCCCAAGC 
TCGCCCAAGC 
TCGCCCAAGC 
TCGCCCAAGC 
TCGCCCAAGC 
TCGCCCAAGC 
TCGCCCAAGC 



HIS 
BZ10 
B2198 

P20 

K38 
Z2491 

H41 
EG329 
PMC21 
EG327 
Consensus 



1611 

GATTGCAACC 
GATTGCAACC 
GATTGCAACC 
GATTGCAACC 
GATTGCAACC 
GATTGCAACC 
GATTGCAACC 
GATTGCAACC 
GATTGCAACC 
GATTGCAACC 
GATTGCAACC 



GCAGGTTTGG 
GCAGGTTTGG 
GCAGGTCTAG 
GCAGGTTTGG 
GCAGGTCTGG 
GCAGGTCTGG 
GCAGGTCTGG 
GCAGGTCTGG 
GCAGGTCTGG 
GCAGGTCTGG 
GCAGGT-T-G 



CTCAGGCGTA 
CTCAGGCCTA 
TTCAGGCGTA 
CTCAGGCCTA 
TTCAGGCGTA 
TTCAGGCGTA 
TTCAGGCGTA 
TTCAGGCGTA 
TTCAGGCGTA 
TTCAGGCGTA 
-TCAGGC-TA 



TTTGCCCGGC 
TTTGCCCGGC 
TCTGCCCGGC 
TTTGCCCGGC 
TCTGCCCGGC 
TCTGCCCGGC 
TCTGCCCGGC 
TTTGCCCGGC 
TTTGCCCGGC 
TCTGCCCGGC 
T-J GCCCGGC 



C5 



AAGAGTATGA 
AAGAGTATGA 
AAGAGTATGA 
AAGAGTATGA 
AAGAGTATGA 
AAGAGTATGA 
AAGAGTATGA 
AAGAGTATGA 
AAGAGTATGA 
AAGAGTATGA 



TGGCGATCGG 
TGGCGATCGG 
TGGCGATCGG 
TGGCGATCGG 
TGGCGATCGG 
TGGCGATCGG 
TGGCGATCGG 
TGGCGATCGG 
TGGCGATCGG 
TGGCGATCGG 
TGGCGATCGG' 



16fi0 
CGGCGGTACT 
CGGCGGTACT 
CGGCGACACT 
CGGCGGTACT 
CGGCGGCACT 
CGGCGGCACT 
CGGCGGCACT 
CGGCGGCACT 
CGGCGGCACT 
CGGCGGCACT 
CG^CG ACT 



FIG. 2cont 



HIS 
BZ10 
BZ198 

P20 

H38 
Z249I 

H41 
EG329 
£MC21 
EG327 
Consensus 



1681 

TATCGCGGCG 
TATCGCGGCG 
TATCGCGGCG 
TATCTCGGCG 
TATCGCGGCG 
TATCGCGGCG 
TATCTCGGCG 
TATCGCGGCG 
TATCGCGGCG 
TATCGCGGCG 
T&TC-CGGCG 



AAGCCGGTTA 
AAGCCGGTTA 
AAGCCGGTTA 
AAGCCGGTTA 
AAGCCGGTTA 
AAGCCGGTTA 
AAGCCGGTTA 
AAGCCGGTTA 
AAGCCGGTTA 
AAGCCGGTTA 

gccg gtj 



CGCCATCGGC 
CGCCATCGGC 
CGCCATCGGC 
CGCCATCGGC 
CGCCATCGGC 
CGCCATCGGC 
TGCCATCGGC 
CGCCATCGGC 
CGCCATCGGC 
TGCCATCGGC 
-GCCATCGG C 



TACTCGAGCA 
TACTCGAGCA 
TACTCAAGTA 
TACTCGAGCA 
TACTCCAGTA 
TACTCCAGTA 
TACTCAAGCA 
TACTCCAGTA 
TACTCCAGTA 
TACTCAAGCA 
TACTC-AG-A 
C5 



TTTCTGACAC 
TTTCTGACAC 
TTTCCGACGG 
TTTCTGACAC 
TTTCCGACGG 
TTTCCGACGG 
TTTCCGCCGG 
TTTCCGACGG 
TTTCCGACGG 
TTTCCGACGG 
'TC-G-C- 



TGGGAATTGG 
TGGGAATTGG 
CGGAAATTGG 
TGGGAATTGG 
CGGAAATTGG 
CGGAAATTGG 
CGGAAATTGG 
CGGAAATTGG 
CGGAAATTGG 
CGGAAATTGG 
-GG-AATTG G 



1750 
GTTATCAAGG 
GTTATCAAGG 
ATTATCAAAG 
GTTATCAAGG 
ATTATCAAAG 
ATTATCAAAG 
ATTATCAAAG 
ATTATCAAAG 
ATTATCAAAG 
ATTATCAAAG 
-TTATCAA-G 



HIS 
BZ10 
BZ198 

P20 

H38 
22491 

H41 
EG329 
PMC21 
EG327 
Consensus 



1751 

GCACGGCTTC 
GCACGGCTTC 
GCACGGCTTC 
GCACGGCTTC 
GCACGGCTTC 
GCACGGCTTC 
GCACGGCTTC 
GCACGGCTTC 
GCACGGCTTC 
GCACGGCTTC 
GCACGGCTTC 



CGGCAATTCG 
CGGCAATTCG 
CGGCAATTCG 
CGGCAATTCG 
CGGCAATTCG 
CGGCAATTCG 
CGGCAATTCG 
CGGCAATTCG 
CGGCAATTCG 
CGGCAATTCG 
■GGCA&TTCG 



CGCGGCCATT 
CGCGGTCATT 
CGCGGCCATT 
CGCGGTCATT 
CGCGGTCATT 
CGCGGCCATT 
CGCGGCCATT 
CGCGGCCATT 
CGCGGCCATT 
CGCGGCCATT 
CGCGG-CATT 



TCGGTGCTTC 
TCGGTACTTC 
TCGGTGCTTC 
TCGGTACTTC 
TCGGTGCTTC 
TCGGTGCTTC 
TCGGTGCTTC 
TCGGTGCTTC 
TCGGTGCTTC 
TCGGTGCTTC 
^CGGT-CTTC 



CGCATCTGTC 
CGCATCTGTC 
CGCATCTGTC 
CGCATCTGTC 
CGCATCTGTC 
CGCATCTGTC 
CGCATCTGTC 
CGCATCTGTC 
CGCATCTGTC 
CGCATCTGTC 
CGCATCTGTC 



GGTTATCAGT 
GGTTATCAGT 
GGTTATCAAT 
GGTTATCAGT 
GGTTATCAGT 
GGTTATCAGT 
GGTTATCAGT 
GGTTATCAGT 
GGTTATCAGT 
GGTTATCAGT 
GGTTATCA-T 



1815 
GGTAA 
GGTAA 
GGTAA 
GGTAA 
GGTAA 
GGTAA 
GGTAA 
GGTAA 
GGTAA 
GGTAA 
GGTAA 



C5 
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FIG. 3A 




FIG. 3B 





CO 





FIG. 4B 




FIG. 4C 



1 MNKZYRIIWtf SALNAWVWS ELTRNHTKRA SATVKTAVliA TLLFATVQAS 

51 ANNETDLTSV GTEKLSFSAN ONKVNITSDT KGLNFAKETA GTNGDTTVHL 

101 NGIGSTLTDT LLNTGATTNV TNDNVTDDEK KRAASVKDVL NAG WN I KG VK 

151 PGTTASDNVD FVRTYDTVEF LSADTKTTTV NVESKDNGKK TEVKIGAKTS 

201 VIKEKDGKLV TGKDKGENGS STDEGEGLVT AKEVIDAVNK AGWRMKTTTA 

251 NGQTGQADKF ETVTSGTNVT FASGKGTTAT VSKDDQGNIT VMYDVNVGDA 

301 LNVNQLQNSG WNLDSKAVAG SSGKVISGNV SPSKGKMDET VNINAGNNIE 

351 ITRNGKNIDI ATSMTPQFSS VSLGAGADAP TLSVDGDALN VGSKKDNKPV 

401 RXTNVAPGVK EGDVTNVAQL KGVAQNIiNNR IDNVDGNARA GIAQAIATAG 

451 LVQAYLPGKS MMAIGGGTYR GEAGYAXGYS SISDGGNWII KGTASGNSRG 

501 HFGASASVGY QW* 



1 ATGAACAAAA TATACCGCAT CATTTGGAAT AGTGCCCTCA ATGCATGGGT 

51 CGTCGTATCC GAGCTCACAC GCAACCACAC CAAACGCGCC TCCGCAACCG 

101 1*GAAGACCGC CGTAtTGGCG ACTCTGTTGT TTGCAACGGT TCAGGCAAGT 

• 151 GCTAACAATG AAACAGATCT GACCAGTGTT GGAACTGAAA AATTATCGTT 

201 TAGCGCAAAC GGCAATAAAG TCAACATCAC AAGCGACACC AAAGGCTTGA 

251 ATTTTGCGAA AGAAACGGCT GGGACGAACG GCGACACCAC GGTTCATCTG 

301 AACGGTATTG GtTCGACTTT GACCGATACG CTGCTGAATA CCGGAGCGAC 

351 CACAAACGTA ACCAACGACA ACGTTACCGA TGACGAGAAA AAACGTGCGG 

401 CAAGCGTTAA AGACGTATTA AACGCTGGCT GGAACATTAA AGGCGTTAAA 

* 451 CCCGGTACAA CAGCTT CCGA TAACGTT GAT TTCGTCCGCA. CTTACGACAC 

501 AGTCGAGTTC TTGAGCGCAG ATACGAAAAC AACGACTGTT AATGTGGAAA 

551 GCAAAGACAA CGGCAAGAAA ACCGAAGTTA AAATCGGTGC GAAGACTTCf 

601 GTTATTAAAG AAAAAGACGG TAAGTTGGTT ACTGGTAAAG ACAAAGGCGA 

651 GAATGGTTCT TCTACAGACG AAGGCGAAGG CTTAGTGACT GCAAAAGAAG 

701 TGATTGATGC AGTAAACAAG GCTGGTTGGA GAATGAAAAC AACAACCGCT 

751 AATGGT CAAA CAGGTCAAGC TGACAAGTTT GAAACCGMA CATCAGGCAC 

601 AAATGTAACC TTTGCTAGTG GTAAAGGTAC AACTGCGACT GTAAGTAAAG 

851 ATGATCAAGG CAACATCACT GTTATGTATG ATGTAAATGT CGGCGATGCC 

901 CTAAACGTCA ATCAGCTGCA AAACAGCGGT TGGAATTTGG ATTCCAAAGC 

951 GGTTGCAGGT TCTTCGGGCA AAGTCATCAG CGGCAATGTT TCGCCGAGGA 

1001 AGGGAAAGAT GGATGAAACC GTCAACATTA ATGCCGGCAA CAACAT C GAG 

1051 ATTACCCGCA ACGGTAAAAA TATCGACATC GCCACTTCGA TGACCCCGCA 

1101 GTTTTCCAGC GTTTCGCTCG GCGCGGGGGC GGATGCGCCC ACTTTGAGCG 

1151 TGGATGGGGA CGCATTGAAT GTCGGCAGCA AGAAGGACAA CAAACCCGTC 

1201 CGCATTACCA ATGTCGCCCC GGGCGTTAAA GAGGGGGATG TTACAAACGT 

1251 CGCACAACTT AAAGGCGTGG CGCAAAACTT GAACAACCGC ATCGACAATG 

1301 TGGACGGCAA CGCGCGTGCG GGCATCGCCC AAGCGATTGC AACCGCAGGT 

1351 CTGGTTCAGG CGTATTTGCC CGGCAAGAGT ATGATGGCGA ?CGGCGGCGG 

1401 CACTTATCGC GGCGAAGCCG GTTACGCCAT CGGCTACTCC AGTATTTCCG 

1451 ACGGCGGAAA TTGGATTATC AAAGGCACGG CTTCCGGCAA TTCGCGCGGC 

1S01 CATTTCGGTG CTTCCGCATC TGTCGGTTAT CAGTGGTAA 



FIG. 5 



1 MNKIYRXIWN SALNAWVAVS ELTRNHTKRA SATVKTAVIA TLLFATVQAN 

51 ATDETGX.INV ETEKLSFGAN GKKVNIISDT KGLNFAKETA GTtfGDTTVHL 

101 NOXGSTIiTDM LtHTGATTNV TK0NVTDDEK KRAASVKDVL NAGWNIKGVK 

151 PGTTASDNVD FVRTYDTVEF LSADTKTTTV NVESKDNGRK TEVKIGAKTS 

201 VI KEKDGKLV TGKGKGENGS 5TDEGEGLVT AKEVTDAVNK AGWRMKTTTA 

251 NGQTGQADKF ETVTSGTKVT FASGNGTTAT VSKDDQGNIT VKYDVNVGDA 

301 LNVNQ3^QNSG WNLDSKAVAG SSGKVI$GNV SPSKGKMDET WXNAGNNIE 

351 XTRNGKN1DI ATSMTPQFSS VSLGAGABAP TLSVDDEGAL UVGS KPANKP 

401 VKITNVAPGV KEGDVTNVAQ LKGVAQNLNN RXDNVNGNAR AGXAQAIATA 

451 GLVQAYL PGK SMMAXGGGTY iGEAGYAXGY SSISAGGNWI JKGTASGNSR 

501 GHFGASASVG YQW* 



1 ATGAACAAAA TATACCGCAT CATTTGGAAT AGTGCCCTCA ATGCCTGGGT 

51 CGCCGTATCC GAGCTCACAC GGAACCACAC CAAACGCGCC TCCGCAACCG 

101 TGAAGACC GC CGTATTGGCG ACACTGTTGT TTGCAACGGT TCAGGCGAAT 

151 GCTACCGATG AAACAGGCCT GATCAATGTT GAAACTGAAA AATTATCGTT 

201 TGGCGCAAAC GGCAAGAAAG TCAACATCAT AAGCGACACC AAAGGCTTGA 

251 ATTTCGCGAA AGAAACGGCT GGGACGAACG GCGACACCAC GGTTCATCTG 

301 AACGGTATCG GTTCGACTTT GACCGATATG CTGCTGAATA CCGGAGCGAC 

351 CACAAACGTA ACCAACGACA ACGTTACCGA TGACGAGAAA AAACGTGCGG 

401 CAAGCGTTAA AGACGTATTA AACGCAGGCT GGAACATTAA AGGCGTTAAA 

451 CCCGGTACAA CAGCTTCCGA TAACGTTGAT TTCGTCCGCA CTTACGACAC 

501 AGTCGAGTTC TTGAGCGCAG ATACGAAAAC AACGACTGTT AATGTGGAAA 

551 GCAAAGACAA CGGCAAGAAA ACCGAAGTTA AAATCGGTGC GAAGACTTCT 

601 GTTATTAAAG AAAAAGACGG 2AAGTTGGTT ACTGGTAAAG GCAAAGGCGA 

651 GAATGGTTCT TCTACAGACG AAGGCGAAGG CTTAGTGACT GCAAAAGAAG 

701 TGATTGATGC AGTAAACAAG GCTGGTTGGA GAATGAAAAC AACAACCGCT 

751 AATGGTCAAA CAGGTGAAGC TGACAAGTTT GAAACCGTTA CATCAGGCAC 

801 AAAAGTAACC TTTGCTAGTG GTAATGGTAC AACTGCGACT GTAAGTAAAG 

851 ATGATCAAGG CAACATCACT GTTAAGTATG ATGTAAATGT CGGCGATGCC 

901 CTAAACGTCA ATCAGCTGCA AAACAGCGGT TGGAATTTGG ATTCCAAAGC 

951 GGTTGCAGGT TCTTCGGGCA AAGT CATCAG CGGCAATGTT TCGCCGAGCA 

1001 AGGGAAAGAT GGATGAAACC GTCAACATTA ATGCCGGCAA CAACATCGAG 

1051 ATTACCCGCA ACGGCAAAAA TATCGACA^C GCCACTTCGA TGACCCCGCA 

1101 ATTTTCCAGC GTTTCGCTCG GCGCGGGGQC GG AT GCGCCC ACTTTAAGCG 

1151 TGGATGACGA GGGCGCGTTG AATGTCGGCA GCAAGGATQC CAACAAACCC 

1201 GTCCGCATTA CCAATGTCGC CCCGGGCGTT AAAGAGGGGG ATGTTACAAA 

1251 CGTCGCGCAA CTTAAAGGTG TGGCGCAAAA CTTGAACAAC CGCATCGACA 

1301 ATGTGAACGG CAACGCGCGT GCGGGCATCG CCCAAGCGAT TGCAACCGCA 

1351 GGTCTGGTTC AGGCGTATCT GCCCGGCAAG AGTATGATGG CGATCGGCGG 

1401 CGGCACTTAT CTCGGCGAAG CCGGTTATGC CATCGGCTAC TCAAGCATTT 

1451 CCGCCGGCGG AAATTGGATT ATCAAAGGCA CGGCTTCCGG CAATTCGCGC 

1501 GGCCATTT GG GTGCTTCCGC ATCTGTCGGT TATCAGTGGT AA 



FIG. 6 



1 l^fKIYRXIWN SALNAWWVS ELTRNKTKRA SATVKTAVLA TUiFATVQAS 

Si ANNVDFVRTY DTVEFLSADT KTTTVNVESK DNGKKTEVKI GAKTSVIKEK 

101 DGKhVTGKVK GENGS3TDEG EGLVTAKEVI DAVNKAGWRM KTTTANGQTG 

151 QADKFETVTS GTNVTFASGK GTTATVSKDD QGNITVMYDV NVGDALNVNQ 

201 LONSGWNLDS KAVAGSSGKV ISGNVSPSKG KMDETVNINA GNNXEITRNG 

251 KNIDXATSKT PQFSSVSLGA GADAPTI#SVD GDAI^NVGS KK DNKPVRITNV 

301 APGVK5GDVT NVAQI*KGVAQ N&NNKIDNVD GNAKAGIAQA SATAGLVQAY 

351 WGKSMMAXG GGTYRGEAGY AIGYSSXSDG GNWIIKGTAS GNSRGHFGAS 

401 A5VGYQW* 



1 ATGAACAAAA TATACCGCAT 

51 CGTCGTATCC GAGCTCACAC 

101 XGAAGACCGC CGTATTGGCG 

151 GCTAACAACG TTGATTTCGT 

201 CGCAGATACG AAAACAACGA 

251 AGAAAACCGA AGTTAAAATC 

301 GACGGTAAGT TGGTTACTGG 

351 AGACGAAGGC GAAGGCTTAG 

401 ACAAGGCTGG TTGGAGAATG 

451 CAAGCTGACA AGTTTGAAAC 

501 TAGTGGTAAA GGTACAACTG 

551 TCACTGTTAT GTATGATGTA 

601 CTGCAAAACA GCGGTTGGAA 

651 GGGCAAAGTC ATCAGCQGCA 

701 AAACCGTGAA GATTAATGCC 

751 AAAAATATCG ACATCGCCAC 

801 GCTCGGCGCG GGGGCGGATG 

851 TGAATGTCGG CAGCAAGAAG 

901 GCCCCGGGCG TTAAAGAGGG 

951 GGTGGCGCAA AACTTGAACA 

1001 GTGCGGGCAT CQCCCAAGCG 

1051 TTGCCCGGCA AGAGTATGAT 

1101 AGCCGGTTAC GCCATCGGCT 

1151 TTATCAAAGG CACGGCTTCC 

1201 GCATCTGTCG GTTAT GAGT G 



CATTTGGAAT AGTGCCCTCA ATGCATGGGT 
GCAACCACAC CAAACGCGCG TCCGCAACCG 
ACTCTGTTGT TTGCAACGGT TCAGGCAAGT 
CCGCACTTAC GACACAGTCG AGTTCTTGAG 
CTGTTAATGT GGAAAGCAAA GACAACGGCA 
GGTGCGAAGA CTTCTGTTAT TAAAGAAAAA 
TAAAGACAAA GGCGAGAATG GTTCTTCTAC 
TGACTGCAAA AGAAGTOATT GAT GCAGTAA 
AAAACAACAA CCGCTAATGG TCAAACAGGT 
CGTTACATCA GGCACAAATG TAACCTTTGC 
CGACTGTAAG T AAAGAT GAT CAAGGCAACA 
AATGTCGGCG ATGCCCTAAA CGTCAATCAG 
TTTGGATTCC AAAGCGGTTG CAGGTTCTTC 
ATGTTTCGCC GAGCAAGGGA AAGATGGATG 
GGCAACAACA TCGAGATTAC CCGCAACGGT 
TTCGATGACC CCGCAGTTOT CCAGCGTTTC 
CGCCCACTTT GAGCGTGGAT GGGGACGCAT 
GACAACAAAC CCGTCCGCAT TACCAATGTC 
GGATGTTACA AACGTCGCAC AACTTAAAGG 
ACCGCATCGA CAATGTGGAC GGCAACGCGC 
ATTGCAACCG CAGGTCTGGT TCAGGCGTAT 
GGCGATCGGC GGCGGCACTT ATCGCGGCGA 
ACTCCACTAT TTCCGACGGC GGAAATTGGA 
GGCAATTCGC GCGGCCXTT'S! CGGTGCTTCC 
GTAA 



FIG. 7 



1 MNKIYRIIWN SALNAWVWS ELTRNHTKRA SATVKTAVIA TLLFATVQAS 

51 AttRAASVKDV LNAGWNIKGV KPGTTASDNV DFVRTYDTVE FLSADTKTTT 

101 VNVESKDNGK KTEVKIGAKT SVTKEKDGKL VTGKDKGENG SSTDEGEGLV 

151 TAKE VI DAVN KAGWRMKTTT ANGQTGQADK FETVTSGTNV TFASGKGTTA 

201 TVSKDDQGNX TVMYDVNVGD ALNVNQIiQNS GWNLDSKAVA GSSGKVISGN 

251 VSfcSKGKMDE TVNINAGNNI EXTRNGKNID tATSMTPQFS SVSliGAGADA 

301 PTLSVDGDAL NVGSKKDNKP VRITNVAPGV KEGDVTNVAQ LKGVAQNLNN- 

351 RXDNVDGMAR AGIAQAIATA GliVQAYLPGK SMMAIGGGTY RGEAGYAIGY 

401 SSISDGGNWI XKGTASGNSR GHFGASASVG YQW* 



1 ATGAACAAAA TATACCGCAT CATTTGGAAT AGTGCCCTCA ATGCATGGGT 

51 CGTCGTATCC GAGCTCACAC GCAACCACAC CAAACGCGCC TCCGCAACCG 

101 TGAAGACCGC CGTATTGGCG ACTCTGTTGT TTGCAACGGT TCAGGCAAGT 

151 GCTAACCGTG CGGCAAGCGT TAAAGACGTA TTAAACGCTG GCTGGAACAT 

201 TAAAGGCGTT AAACC CGGTA CAACAGCTTC CGATAACGTT GATTTCGTCC 

251 GCACTTACGA CACAGTCGAG TTCTTGAGCG CAGATACGAA AACAACGACT 

301 GTTAATGTGG AAAGCAAAGA CAACGGCAAG AAAACCGAAG TTAAAATCGG 

351 TGCGAAGACT TCTGTTATTA AAGAAAAAGA CGGTAAGTTG GTTACTGGTA 

401 AAGACAAAGG CGAGAATGGT XCTT CTACAG ACGAAGGCGA AGGCTTAGTG 

451 ACTGCAAAAG AAGTGATTGA TGCAGTAAAC AAGGCTGGTT GGAGAATGAA 

501 AACAACAACC GCTAATGGTC AAACAGGTCA AGCTGACAAG TTTGAAACCG 

551 TTACATCAGG CACAAATGTA ACCTTTGCTA GTGGTAAAGG TACAACTGCG 

601 ACTGTAAGTA AAGATGATCA AGGCAACATC ACTGTTATGT ATGATGTAAA 

651 TGTCGGCGAT GCCCTAAACG TCAATCAGCT GCAAAACAGC GGTTGGAATT 

701 TGGATTCCAA AGCGGTTGCA GGTTCTTCGG GCAAAGT CAT CAGCGGCAAT 

751 GTTTCGCCGA GCAAGGGAAA GATGGATGAA ACCGTCAACA TTAATGCCGG 

801 CAACAACATC GAGATTACCC GCAACGGTAA AAATATCGAC ATCGCCACTT 

851 CGATGACCCC GCAGTTTTCC AGCGTTTCGC TCGGCGCGGG GGCGGATGCG 

901 CCCACTTTGA GCGTGGATGG GGACGCATTG AATGTCGGCA GCAAGAAGGA 

951 CAACAAACCC GTCCGCATTA CCAATGTCGC CCCGGGCGTT AAAGAGGGGG 

1001 ATGTTACAAA CGTCGCACAA CTTAAAGGCG TGGCGCAAAA CTTGAACAAC 

1051 CGCATCGACA ATGTGGACGG CAACGCGCGT GCGGGCATCG CCCAAGCGAT 

1101 TGCAACCGCA GGTCTGGTTC AGGCGTATTT GCCCGGCAAG AGTATGATGG 

1151 CGATCGGCGG CGGCACTTAT CGCGGCGAAG CCGGTTACGC CATCGGCTAC 

1201 TCCAGTATTT CCGACGGCGG AAATTGGATT ATCAAAGGCA CGGCTTCCGG 

1251 CAATTCGCGC GGCCATTTCG GTGCTTCCGC ATCTGTCGGT TATCAGTGGT 

1301 AA 
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1 MKKIYRXIWN SALNAWVWS ELTRNHTKRA $ATVKTAVIA TLLFATVQAS 

51 ANTLKAGDNL KIKQFTYSLK KDLTDLT3VG -TEKLSFSANG NKVHITSDTK 

101 GLNFAKETAG TNGDTTVHLN GXGSTLTDRA ASVKDYLKAG WNXKGVKNVD 

151 FVRTYDTVEF LSADTKTTTV NVESKDHGKK TEVKIGAKTS VIKEKDGKLV 

201 TGKDKGENGS STDEGEGLVT AKEVtDAVWK AGWRMKTTTA NGQTGQADKF- 

251 ETVTSGTNVT FASGKGTTAT VSKDDQGNIT VMYDVNVGDA LNVNQLQNSG 

301 WNLDSKAVAG SSGKVI5GNV SPSKGKMDET VNINAGNNIE ITRNGKNIDI 

351** ATSMTPQFSS VSLGAGADAP TLSVDGDALtt VGSKKDNKPV RITNVAPGVK 

401 EGDVTNVAQL KGVAQNLNNR IDNVDGNARA GIAQAIATAG LVQAYLPGKS 

451 MMAIGGGTYR GEAGYAIGYS SISDGGKWXI KGTASGNSRG HFGASASVGY 

501 QW* 



1 ATGAACAAAA TATACCGCAT CATTTGGAAT AGTGCCCTCA ATGCATGGGT 

51 CGTCGTATCC GAGCTCACAC GCAACCACAC CAAACGCGCC TCCGCAACCG 

101 TGAAGACCGC CGTATTGGCG ACTCTGTTGT TTGQAACGGT TCAGGCAAGT 

151 GCTAACACCC TCAAAGCCGG CGACAACCTG AAPlATCAAAC AATTCACCTA 

201 CTCGCTGAAA AAAGACCTCA CAGATCTGAC CAGTGTTGGA ACTGAAAAAT 

251 TATCGTTTAG CGCAAACGGC AATAAAGTCA ACATCACAAG CGACACCAAA 

301 GGCTT GAATT TTGCGAAAGA AACGGCTGGG ACGAACGGCG ACACCACGGT 

353. TCATCTGAAC GGTATTGGTT CGACTTTGAC CGATCGTGCG GCAAGCG^TA 

4 01 AAGACGTATT AAACGCTGGC T GGAACATT A AAGGCGTTAA AAACGTTGAT 

451 TTCGTCCGCA CTTACGACAC AGTCGAGTTC TT&AGCGCAG ATACGAAAAC 

501 AACGACTGTT AATGTGGAAA GCAAAGACAA CGGCAAGAAA ACCGAAGTTA 

551 AAATCGGTGC GAAGACTTCT GTTATTAAAG AAAAAGACGG TAAGTTGGTT 

601 ACTGGTAAAG ACAAAGGCGA GAATGGTTCT TCTACAGACG AAGGCGAAGG 

651 CTTAGTGACT GCAAAAGAAG TGATTGATGC AGTAAACAAG GCTGGTTGGA 

701 GAATGAAAAC AACAACCGCT AATGGTCAAA CAGGTCAAGC TGACAAGTTT 

751 GAAACCGTTA CAT CAGGCAC AAATGTAACC TTTGCTAGTG GTAAAGGTAC 

801 AACTGCGACT GTAAGTAAAG ATGATCAAGG CAACATCACT GTTATGTATG 

S51 ATGTAAATGT CGGCGATGCC CTAAACGTCA ATCAGCT GCA AAACAGCGGtt 

901 TGGAATTTGG ATTCCAAAGC GGTTGCAGGT TCTTCGGGCA AAGTCATCAG 

351 CGGCAATGTT TCGCCGAGCA AGGGAAAGAT GGATGAAACC GTCAACATTA 

1001 ATGCCGGCAA CAACATCGAG AtfTACCCGCA ACGGTAAAAA TATCGACATC 

1051 GCCACTTCGA TGACCCCGCA GTTTTCCAGC GTTTCGCTCG GCGCGGGGGC 

1101 GGATGCGCCC ACTTTGAGCG TGGATGGGGA CGCATTGAAT GTCGGCAGCA 

1151 AGAAGGACAA CAAACCCGTC CGCATTACCA ATGTCGCCCC .GGGCGTTAAA 

1201 GAGGGGGATG TTACAAACGT CGCACAACTT AAAGGCGTGG CGCAAAACTT 

1251 GAACAACCGC ATCGACAATG TGGACGGCAA CGCGCGTGCG GGCATCGCCC 

1301 AAGCGATTGC AACCGCAGGT CTGGTTCAGG CGYAT'TTGCC CGGCAAGAGT 

1351 ATGATGGCGA TCGGCGGCGG CACTTATCGC GGCGAAGCCG GTTACGCCAf 

14 0 3. CGGCTAC^CC AGTATTTCCG ACGGCGGAAA TTGGATTATC AAAGGCACGG 

14 51 CTTCCGGCAA TTCGCGCGGC CATTTCGGTG CTTCCGCATC TGTCGGTTAT 

1501 CAGTGGTAA 



FIG. 9 



L 50 

MSKIYRIIWN SALNAW VAVS ELTRNHTKRA SATVKTAVLA T LT. FAi* vn& w 

MgKIYRITWN SALNAWVWS DLTRNHTKRA SATVNTAVTA TLT.FA Tv nafi 

MNKIYRIIw* SALNAWVAVS ELTRNHTKRA SATVKTAVLA TLLFATVQAN 

PMC2lBgIdel MNKlYRXIfcr^ SALMAWWVS ELTRNHTKRA SATVKTAVLA TLLFATVQAS 

PMC21C1C5 MNKIYRIIWK SALNAWWVS ELTRNHTKRA SATVKTAVLA TLLFATVQAS 

CI 



H41 
PMC21 
HdlStudel 



« 41 ATDED. .SEE ELESVQRS.V VG5IQA5MEG SVELET...I SLSMTtoQSKE 
PMC21 ANMEEQgEYL YLHPVQRTVA VLIVtt$DKEG AGEKEKVEEN SPWAVYFNEK 

H41Studel ATOE...--- TT ~ — 

PMC2lBefldel ANNE ^ ] * 

PMC21C1C5 AN , T ]]'_ w . ][' ]][' 

VI 



1°1 150 
H41 FVDPVIWTL KAGDNLKIKO N.TNSNTNAS SFTYSLKKEL TGtlNVETEK 

PMC21 GVLTAREXTL KA GpftTLKrKQ NGTN - ETYSLKKdt, TDLT^ V^F.y 

H4lStUdel r TGLI fclVETEK 

PMC21Bgld6l * . - . T TDLTSVGTEK 

PMcaicics , , 

VI G2 V2 C3 

151 200 
H41 LSFGAKGKKV HIXSPTKGL N FAKETAGTK6 PTTVHLKGIG STLTDMLLNT 
PMC21 LSFSAHGttKV NITSDTKGLN FAKBTAGTNG DTTVHLNGIG STLTPTLLMT 
H41Studel LS FGANGKKV NXISDTKGLN FAKETAGTNG DTTVHLNGIG STLTDMLLNT 
PMC21Bgldel LSFSANGNKV NXTSDTKGLN FAKETAGTNG DTTVKLUGIG STLTDTLLNT 

PMC21C1C5 . . * . , 4 

C3 V3 



K41 
PMC 21 
HUStudel 



201 

GATTNVTMQN 
OATTNVTNDN- 
GATTMVTND^ 



250 



VTPDEKKRAft , . SVKDVLMAGW NIKGVKPGTT ASDNVDFVfi T 
VT&PSKKBAA SVKPVLNAGW KlKgVKPGTT ASDNVQFVRT 
VTDDEKKRAA SVKDVLfcJAGW NIKGVKPGTT ASDNVDFVRT 
PMC2lBcrldol GATTNVTND?; VTDDEKKRAa, SVKDVLNAGW NIKGVK£>GTT ASDNVDFVRT 

PMC21C1C5 , ... ,,,NVDFVRT 

V3 C4 V4 C5 



251 



300 



H41 YDTVE^LSAD TKTTTVNVES KPNGKKTEVK IGAKTSVTKE KDGKLVTGKG 
YDTVEFLSAD TKTTTVWES KDMGKKTEVK IGAKTSVIKE KDGKLVTGKD 
K41Studel YDTVEFLSAI} TKTTTVHVES KDNGKKTEVK IGAKTSVIKE KDGKLVTGKG 
PMC21Bgldel YDTVEFLSAD TKTTTVNVES KDNGKKTEVK IGAKTSVIKE KDGKLVTGKD 
PMC21C1C5 YDTVE FLSAD TKTTTVNVES KDNGKKTEVK IGAKTSVIKE KDGKLVTGKD 

GS 



301 350 
H41 KGESt^SSTCS GEGLVTAKEV I DAVNKAGWR MKTTTAWGQT GQADKFETVT 
PMC21 KGENGSSTDE GEGLVTAKEV ^VNXAGWR Mfr^TANGftT n GQADKFETVT 
H41Studel ^ " 



kGengsstd: 

PMCaiSgldsl KGENGSSTD 
PMC21C1C5 KGENGSSTC: 



GEGLVTAKEV 
GEGLVTAKEV 
GEGLVTAKEV 



IDAVKKAGWR MKTTTARGQT 
XDAVNKAGWR MKTTTANGQT 
IDAVNKAGWR MKTTTANGQT 
C5 



GQADKFETVT 
GOADKFETVT 
GQADKFETVT 



H41 
FMC21 

H41Studel SGTKVTFAS 

PMC21B^ldQl SGTNVTFAS: 

PMC2 1C1CS SGT&JV7FAS 3 



351 400 
^^ TtfVTFASG NGTTATVSKP DQGNITVKVD VNVGDAL^VH QLOMSGWWLD 



SGTMVTFASS KGTTATVSKD DOGNITViyiYD VNVGDALWVy OLOHSGHpi^D 



KGTTATVSKD 
KGTTATVSKD 
KGTTATVSKD 



DQGNITVKYD 
DQGNITVMYD 
DQGNITVMYD 

cs 



VNVGDALNVN 
VKVGDALNVN 
V^JVGDALNVN 



QLQM$GWNLD 
OLQKTSGWNLD 
QX.QNSGWWLD 



FIG. 10 



401 



H41 
PMC2I 
H4iStudel 



SKAVAGSSfiff V TSGKVSPSK GKMPEgyNl^ fiGMMTgTTPW 
SKAVAGSSGK VISGNVSPSK GKMDETVNIN AGNNIEITRN 
PMC2lB^ldel SKAVAGSSGK VISGNVSFSK GKMDETVNIN AGNNtElTRN 
PMC21C3.C5 SKAVAGSSGK V3CSGNVSPSK GKMDETVNIN AGNNIEITRN 

C5 



450 



f^^gggg* VTSfiNyggaiC GKMDETVNIN ASWMlEiy^W 

r " r "'"'' " ----- jgKwiptAya^ 

GKNIDIATSM 
GKNIDXATSM 
GKNIDIATSM 



H41 
PMC21 
«4IStudsl 



451 500 
TPOFSSVSLG AGADAPTT.SV OQEGALMVg S KDAWKPVRlT , NVAPGVKEGD 
yPOFSSVSLG AGADAPTrSV DG * faA,T. NVGS KKraJTcgyglT biVAPgVKEGD 
TPQFSSVSLG AGADAPTLSV ODEGA^NVGS KDANKPVRIT NVA^VKEGD 
Pt5C2lBgldel TPQFSSVSLG AGADAPTLSV DG , DALNVGS KKDNKPVRIT NVAPGVKEGD 
PMC21C1C5 TPQFSSVSLG AGADAJpTLSV DG . DALNVGS KKDNKPVRIT NVAPGVKEGD 

C5 

50 * 550 
H41 VTNVAOLKGV AONLNMRT DN VNGNARAG lA OAlATAfT,VQ AYLPGKSMMA 
«™=2l VTOVaOLKSV AONLNUR^n^ VDGSARAG IA OAIATAGLVO AY£pCTTfiBS 
H4XStudel VTOVAQLKGV AQNLNNRIDN VNGNARRGIA QAIATAGLVQ AYLPGKSMMA 
PMC2lBCfldel VTNVAQLKGV AQN&NNRIDN VDGNARAGIA QAlATAGLVQ AYLPGKSMMA 
PMC21C1C5 VTNVAQLKGV AQNLNNRIDN VDGNARAGIA QAIATAGLV0 AYLPGKSMMB. 

C5 



551 600 
1GGGTYLGSA GYATGYSStS AGGNWtlKGT ASGKjffif HFG ASASVGYOK . 
IGGGTYRGEA GYASGYSSTS DGGN WTTKGT AJjjGHSRGHFG ASASVGyQW . 
IGGGTYIiGEA GYAIGYSSIS AGGNWIIKGT ASGNSRGHFG ASASVGYQW. 
PMC21Bgldel IGGGTYRGEA GYAIGYSSIS DGGNWIIKGT ASGNSRGHFG ASA$VGYQW. 
PMC21CaC5 IGGGTYRGEA GYAIGYSSIS DGGNWIIKGT ASGNSRGHFG ASASVGYQW, 

C5 



H41 
PMC21 
fUlStudel 



FIG. 10 cont'd 




FIG. 11 




FIG. 12 



13 
Q 
■4 
H 

w 

M 

ru 

Q 
I* 

m 
in 
o 




r-? ♦* iH O O O O 



T-i r-l O O O 



tH r-l O O O O 



r-i r-i O O O 



r-l t-4 O O O O 



H ♦« r-i O O O 



r-l «- r-l O O O O 



r-f ri O O O 



rH - r-i O O O O 



Hf °* tH O O O 



1-* -« r-i O O O O 



Hi *• HI O O O 



FIG. 13 



A 

52 NNEEQEEYL YLHPVQRTVA VLXVNSDKEG AGE KEKVEEN SDWAVYFNEK 

101 GVLTAREITL KAGDNLKIKQ NGTNFTYSLK KDLTDLTSVG TEKLSFSAHG 

151 NKVNITSDTK GLNFAKETAG TNGDTTVHLN GIGSTLTDTL LNTGATTNVT 

201 NDNVTDDEKK RAASVKDVLN AGWNIKGVKP GTTASDNVDF VRTYDTVEFL 

251 SADTKTTTVN VBSKDNGKKT EVKI GAKTS V IKEKDGKLVT GKDKGSNGSS 

301 TDEGEGLVTA KEVIDAVNKA GWRMKTTTAN GQTGQADKFE TVTSGTNVTF 

351 ASGKGTTATV SKDDQGNITV MYDVNVGDAL NVNQLQNSGW NLDSKAVAGS 

401 SGKVISGNVS PSKGKMDETV NXNAGNNISI TRNGKNIDIA TSMTPQFSSV 

451 SLGAGADAPT LSVDGDALNV GSKKDNKPVR ITNVAPGVKE GDVTNVAQLK 

501 GVAQNLNNRI DNVDGNARAG IAQAIATAGL VQAYLPGKSM MAIGGGTYRG 

551 EAGYAIGYSS ISDGGNWIIK GTASGNSRGH FGASASVGYQ W* 
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GLNFAKETAG TNGDTTVHLN 
RAASVKDVLN AGWNIKGVKP 
VESKDNGKKT EVKI GAKTS V 
KEVIDAVNKA GWRMKTTTAN 
SKDDQGNITV KYDVNVGDAL 
PSKGKMDETV NINAGNNIEI 
LSVDDEGALN VGSKDANKPV 
IDNVNGNARA GIAQAIATAG 
SISAGGNWII KGTASGNSRG 



LETISLSMTN DSKE FVDPYI 
KDLTGLINVE TEKLSFGANG 
GIGSTLTDML LNTGATTNVT 
GTTASDNVDF VRTYDTVEFL 
IKEKDGKLVT GKGKGENGSS 
GQTGQADKFE TVTSGTKVTF 
NVNQLQNSGW NLDSKAVAGS 
TRNGKNIDIA TSMTPQFSSV 
RITNVAPGVK EGDVTNVAQL 
LVQAYLPGKS MMAIGGGTYL 
HFGASASVGY QW* 
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52 TDETGLINV ETEKLSFGAN GKKVNIISDT KGLNFAKETA GTNGDTTVHL 

101 NGIGSTLTDM LLNTGATTNV TNDNVTDDEK KRAASVKDVL NAGWNIKGVK 

151 PGTTASDNVD FVRTYDTVEF LSADTKTTTV NVESKDNGKK TEVKIGAKTS 

201 VIKEKDGKLV TGKGKGENGS STDEGEGLVT AKEVIDAVNK AGWRMKTTTA 

251 NGQTGQADKF BTVTSGTKVT FASGNGTTAT VSKDDQGNIT VKYDVNVGDA 

301 LNVNQLQNSG WNLDSKAVAG SSGKVISGNV SPSKGKMDET VNINAGNNIE 

351 ITRNGKNIDI ATSMTPQFSS VSLGAGADAP TLSVDDEGAL NVGSKDANKP 

401 VRITNVAPGV KEGDVTNVAQ LKGVAQNLNN RIDNVNGNAR AGIAQAIATA 

451 GLVQAYLPGK SMMAIGGGTY LGEAGYAIGY SSISAGGNWI IKGTASGNSR 

501 GHFGASASVG YQW* 
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52 NNVDFVRTY DTVEFLSADT KTTTVNVESK DNGKKTEVKI GAKTSVIKEK 

101 DGKLVTGKDK GENGSSTDEG EGLVTAKEVI DAVNKAGWRM KTTTANGQTG 

151 QADKFETVTS GTNVTFASGK GTTATVSKDD QGNITVMYDV NVGDALNVNQ 

201 LQNSGWNLDS KAVAGSSGKV ISGNVSPSKG KMDETVNINA GNNIEITRNG 

251 KNIDIATSMT PQFSSVSLGA GADAPTLSVD GDALNVGSKK DNKPVRITNV 

301 APGVKEGDVT NVAQLKGVAO NLNNRIDNVD GNARAGIAQA IATAGLVQAY 

351 LPGKSMMAIG GGTYRGEAGY AIGYSSISDG GNWIIKGTAS GNSRGHFGAS 

401 ASVGYQW* 
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